QUIS: InSitu Heterogeneous Data Source Querying

نویسندگان

  • Javad Chamanara
  • Birgitta König-Ries
  • H. V. Jagadish
چکیده

Existing data integration frameworks are poorly suited for the special requirements of scientists. To answer a speci c research question, often, excerpts of data from di erent sources need to be integrated. The relevant parts and the set of underlying sources may di er from query to query. The analyses also oftentimes involve frequently changing data and exploratory querying. Additionally, The data sources not only store data in di erent formats, but also provide inconsistent data access functionality. The classic ExtractTransform-Load (ETL) approach seems too complex and time-consuming and does not t well with interest and expertise of the scientists. With QUIS (QUery In-Situ), we provide a solution for this problem. QUIS is an open source heterogeneous in-situ data querying system. It utilizes a federated query virtualization approach that is built upon plugged-in adapters. QUIS takes a user query and transforms appropriate portions of it into the corresponding computation model on individual data sources and executes it. It complements the segments of the query that the target data sources can not execute. Hence, it guarantees full syntax and semantic support for its language on all data sources. QUIS's in-situ querying facility almost eliminates the time to prepare the data while maintaining a competitive performance and steady scalability. The present demonstration illustrates interesting features of the system: virtual schemas, heterogeneous joins, and visual query results. We provide a realistic data processing scenario to examine the system's features. Users can interact with QUIS using its desktop workbench, command line interface, or from any R client including RStudio Server.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query Processing and Optimisation in Integrated Heterogeneous Grid Resources∗

The performance of Grid computing technologies for distributed data access and query processing has been investigated in a number of studies. However, different Grid data sources may have schema conflicts which require fine-grained resolution through the use of data integration technologies that are not supported by the current generation of Grid data access and querying middleware. This is par...

متن کامل

UniQue: An Approach for Unified and Efficient Querying of Heterogeneous Web Data Sources

Governments, organizations, and people are publishing open data on the Web more than ever before. To consume the data, however, requires substantial effort from web mashup developers, as they have to familiarize themselves with a diversity of data formats and query techniques specific to each data source. While several solutions have been proposed to improve web querying, none of them covers af...

متن کامل

SEEDEEP: A System for Exploring and Querying Scientific Deep Web Data Sources

A recent and emerging trend in scientific data dissemination involves online databases that are hidden behind query forms, thus forming what is referred to as the deep web. In this paper, we propose SEEDEEP, a System for Exploring and quErying scientific DEEP web data sources. SEEDEEP is able to automatically mine deep web data source schemas, integrate heterogeneous data sources, answer cross-...

متن کامل

ROHDIP: Resource Oriented Heterogeneous Data Integration Platform

During the last few years, the revolution of social networks such as Facebook, Twitter, and Instagram led to a daily increasing of data that are heterogeneous in their sources, data models, and platforms. Heterogeneous data sources have many forms such as the www, deep web, relational databases systems, No-SQL database systems, hierarchal data systems, semistructured files, in which data are us...

متن کامل

Querying Heterogeneous XML Sources through a Conceptual Schema

XML is a widespread W3C standard used by several kinds of applications for data representation and exchange over the web. In the context of a system that provides semantic integration of heterogeneous XML sources, the same information at a semantic level may have different representations in XML. However, the syntax of an XML query depends on the structure of the specific XML source. Therefore,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2017